Picture for Yi Shen

Yi Shen

Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Add code
May 08, 2025
Viaarxiv icon

Quantitative Analysis of Performance Drop in DeepSeek Model Quantization

Add code
May 05, 2025
Viaarxiv icon

DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models

Add code
Mar 06, 2025
Viaarxiv icon

Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models

Add code
Feb 05, 2025
Viaarxiv icon

Hypercube Policy Regularization Framework for Offline Reinforcement Learning

Add code
Nov 07, 2024
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

Inverse Reinforcement Learning from Non-Stationary Learning Agents

Add code
Oct 18, 2024
Figure 1 for Inverse Reinforcement Learning from Non-Stationary Learning Agents
Figure 2 for Inverse Reinforcement Learning from Non-Stationary Learning Agents
Figure 3 for Inverse Reinforcement Learning from Non-Stationary Learning Agents
Figure 4 for Inverse Reinforcement Learning from Non-Stationary Learning Agents
Viaarxiv icon

Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare

Add code
Oct 09, 2024
Figure 1 for Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare
Figure 2 for Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare
Figure 3 for Distributionally Robust Clustered Federated Learning: A Case Study in Healthcare
Viaarxiv icon

Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives

Add code
Oct 02, 2024
Figure 1 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Figure 2 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Figure 3 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Figure 4 for Towards Efficient Moion Planning for UAVs: Lazy A* Search with Motion Primitives
Viaarxiv icon

Geometric Analysis of Unconstrained Feature Models with $d=K$

Add code
Jul 15, 2024
Viaarxiv icon